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Abstract — We introduce a class of anchoritic sensor networks, 
where communications between sensor nodes is undesirable or 
infeasible, due to, e.g., harsh environment, energy constraints, or 
security considerations. 

Instead, we assume that the sensors buffer the measurements 
of the physical phenomena over the lifetime, and report measure- 
ments directly to a sink (note that "report" does not necessarily 
require direct communication). Upon retrieval of the reports, all 
sensor data measurements will be available to a central entity 
for post processing. 

Our algorithm is based on the further assumption that some 
of the data fields that are being observed by the sensors can 
be modeled as a local (i.e. having decaying spatial correlations) 
stochastic process; if not, then choose another auxiliary field, 
such as hydroacoustic noise, cloud shadows cast on the ground, 
or animal heat. The sensor nodes record the measurements, or a 
function of the measurements, e.g., "1" when the measured signal 
is above a threshold, or "0" otherwise. These sensors generated 
time-stamped sequences are ultimately transferred to the sink. 
The localization problem is then approached by analyzing the 
correlations between these sequences at pairs of nodes. 

As engineering ramifications to the idea of anchoritic sensors, 
we discuss the localization scheme for sensors deployed on the 
seabed, where radio signals are strongly attenuated in sea water 
within feet of their transmission; also, we discuss a two-tiered 
architecture tasked with combining heterogenous nodes: deaf 
sensors and local masters. 

I. Introduction 

The "coordinate-free" localization problem in sensor net- 
work has attracted significant attention in the literature, see, 
e.g. the survey [1]. The problem is to determine positions of 
the nodes in the network without using absolute reference in- 
formation, like GPS or direction/distance information relative 
to some known beacons. Coordinate-free localization problem 
is therefore to determine the absolute positions of the nodes 
using only the local information, e.g., the internode distances 
or relative directions. This local-to-global localization problem 
presents a serious research challenge and the amount of work 
on it is rapidly growing, see, e.g., [2]— [15]. However, in 
majority of publications, addressing the localization problem 
either assumes extensive internode communications, i.e., a 
bidirectional exchange of signals which is used to infer the 
pairwise distances; or assumes some variant of a system of 
beacons that can send globally structured calibrating signals 
("poor man's GPS"). 

Within the framework of internode communications, one 
uses one or a combination of RSS (received signal strength), 
ToA (time of arrival) and AoA (angle of arrival) data to recon- 
struct the mutual positions of the nodes, and consequently to 



determine their absolute positions. The beacons with known 
positions provide absolute reference points for the remaining 
sensor nodes. 

Approaches with globally broadcasted and centrally steered 
signals that reach a significant fraction of all the sensors can 
also be efficiently used to calibrate locations. For example, 
the Spotlight system [16], with the aid of steerable laser rays 
that sweep over the monitored terrain obtains the locations 
of sensor nodes without equipping them with specialized 
ranging hardware. We note however that this requires precise 
knowledge on the trajectory of the ray. 

In this paper, we look into the networks subject to severe 
communication constraints. In particular, we do not allow 
either internode communications nor any centrally structured 
signals to be used for the localization. 

We decouple the localization problem into two steps: the 
first stage is to recover the internode mutual positions, and 
the second stage is to use the information obtained from the 
first stage to reconstruct the global positions of the nodes. 

Traditionally, the localization problem (in the absence of 
centrally steered signals) implicitly assumes that the relative 
position information should be gained from the signal ex- 
change (chatter) between the nodes, and that the subsequent 
processing of the exchanged signals is necessary. However, as 
we argue below, the requirement that the sensors are able to 
regularly emit signals and process information is undesirable 
in many applied situations, due to, e.g., harsh environment, 
energy constraints, or security reasons. Assuming this for an 
instant, we ask: 

can the localization problem, or rather the first 
stage of the localization problem — recovery of the 
internode distances, be solved under the conditions 
that the sensor nodes do not have the ability to 
exchange signals, and that no global signals are 
applicable for localization ? 

Clearly, the sensors have to be able to gather some measure- 
ments and eventually report them to a sink/processing entity; 
a sensor which is unable to do even that much can be removed 
without any detrimental effect for the network operation. 

We will refer to sensor networks subject to constraints 
of these types (deprived of the ability to chat 1 and lacking 
centrally conttolled broadcasted signals) as anchoritic sensors. 

To be more precise, chatting is a two-way process involving listen and 
talk, while for anchoritic sensors, either the sensors can not listen or they can 
not talk, or even neither. 



It is perhaps counterintuitive that the localization problem 
for anchoritic sensor networks can be solved. Before present- 
ing our approach, we need, however, answer the following 
natural questions: 

1) When and where anchoritic sensor networks are neces- 
sary? 

2) If no communication allowed, how can the measure- 
ments collected by each individual sensors be transferred 
to the sink/center? 

These questions are addressed in the next subsection. In 
the last subsection of the introduction we describe related 
work. The rest of this paper is organized as follows: after 
describing the essentials of our approach to the localization 
problem for anchoritic sensor networks and some engineering 
ramifications, we give several models of the environmental 
random fields in section [HI] These models are candidates 
for recovering the mutual distance data in anchoritic sensor 
networks. Section [V] presents the results of our numeric 
simulations, which are followed by the conclusion. 

A. Motivation 

1 ) Anchoritic sensors: Situations where the sensors collect- 
ing data are anchoritic are much more widespread than one 
might suppose. 

First of all, when the sensor networks are immersed into 
a harsh environment where the communications between the 
sensor nodes is infeasible, standard approach that uses in- 
ternode communication signals to infer the pairwise distances 
no longer applies. As an important application example, we 
consider the sensors deployed on the seabed. These sensors 
would have difficulties using the radio signals to communi- 
cate, because electromagnetic waves are strongly attenuated 
in water. Even using alternative underwater acoustic signals, 
for a large scaled sensor network, is not a good solution, 
since, compared with electromagnetic waves, the latency using 
acoustic signals is typically much higher as sound waves travel 
much slower , and, due to the multi-path propagation and noise 
characteristics, the effective data rates and packet loss is much 
greater as well [17]. Hence one might resort to anchoritic 
assumptions for large scaled underwater sensor networks. 

Also, if the number of sensors grows to 10 5 -10 6 , the cost of 
any extra feature with which the sensors are provided scales 
correspondingly. Though requiring the ability to listen and to 
analyze the received signals increases the cost of a single 
device by only an innocuous amount of expense, it results 
in a prohibitively expensive cost for a large sensor network. 
Hence, the option of only choosing mute or deaf sensors 
that are cheap, or perhaps to combine emaciated sensors with 
expensive and powerful ones, might force the developers to 
adopt the anchoritic requirements. 

Another obvious situation where the sensor chatter is un- 
desirable is the battlefield or other applications involving an 
adversary. Transmitting signals during the mission reveals the 
presence of the sensors, and therefore makes them vulnerable 
to suppression and manipulation. Similarly, one would not 
deploy any globally structured signals for localization purposes 



as an adversary could generate noise or worse, emulate the 
system signals to compromise the localization completely. 

While there are further scenarios where anchortic networks 
could be necessary, these situations — physical obstacles, cost 
considerations, security requirements — seem to cover most 
of them. 

2) Information delivery: Now, if the sensors in the network 
are silent, how can they report the data to the sink? There are 
again several scenarios. 

First, the sensors (or their data storage units) can be 
indirectly collected after their mission is completed. As for 
the sensor networks deployed on the seabed. Buoyant sensors 
can be attached to heavy ballasts and sink themselves. After 
a period of time during which the sensors perform their 
measurements, the ballasts are released and the sensors emerge 
to the surface where their measurements can be collected either 
manually or by radio signals. Clearly, the original positions of 
the sensors cannot be reliably estimated just by their locations 
on the sea surface. 

Another case is relevant to those sensors which can only 
report their collected measurements within a short period of 
time after being activated by extraneous command signals and 
have to keep silent otherwise. For example, sensors serving 
for military or security purposes in some special applications 
should not reveal themselves until the surroundings are safe. 
Here, the sensors are equipped with radios capable of talking. 
However, for the reasons stated above, the communication 
should be on demand and short, e.g., sensors deployed in 
the enemy's rear area. Hence, the sensors transmit just once 
during their operation cycle, transferring to the center (which 
can be, e.g., a mobile agent that passes by) all the information 
they gathered. After that, the sensors either wait for another 
transmission cycle, or even are compromised. 

In either case, the central entity should be able to recover 
the original positions of the sensors depending only on the 
individual measurements by each sensor, and this information 
is oblivious of the positions and the existence of other sensors 
in the network. 

B. Related Work 

As we mentioned above, most of the publications on 
the "coordinate-free" localization problem follow the path 
of reconstructing locations from the proximity data. These 
approaches typically assume the distances being given by RSS 
data, or just by the connectivity patters of the network formed 
by the sensors. Then some geometric considerations are used, 
followed possibly by iterative adjustment and fine tuning. The 
works following to some degree this pattern are, e.g., [2]— [15]. 

Some deviations from this scheme are also considered in 
literature. For example, the work [18] assumes a lack of 
communications between the nodes, yet relies on several 
anchors which can communicate with significant parts of the 
network. The distances to these anchors are then used for the 
localization. 

Similarly, [19] assumes a system of beacons having known 
positions and sending acoustic signals used for the localization. 



A somewhat more complicated approach mixing internode 
chatter and beacons is used in [20]. 

The localization system [16] achieves high accuracy in 
recovering coordinates of the nodes without requiring intern- 
ode chatter. However, some ranging signals (steerable laser 
rays sweeping over the terrain populated with sensors) are 
necessary. 

In the existing literature, the one most close to our tech- 
niques appears to be the ingenious SLAT (simultaneous local- 
ization and tracking) proposal [21]. There, the authors consider 
a network of cameras tracking a moving object by recovering 
their own positions and then the trajectory of the tracked 
object. While conceptually not completely disjoint from our 
model of random walkers (see Section ITlI-C> . the approach of 
[21] relies heavily on the uniqueness of the moving object and 
on the far range of their sensing devices. Introducing many 
targets seems to require a major overhaul of the approach 
used there, which might lead to a statistical procedure of 
distinguishing between multiple targets, and thus to techniques 
close to ours. 

On a more conceptual level, the correlations between the 
measurements have been used in the sensor/ad-hoc networks, 
most notably to develop coding schemes. It has been proposed 
to use correlations in the measurements to improve the net- 
work throughput. Similarly, using correlated signals in ad-hoc 
wireless MIMO networks can improve the transmission rates. 
Here, however, we do not try to filter the noise out of the 
signals, but rather to use the noise (insofar it admits some 
decaying correlation functions) for the localization. 

II. Approach 

We approach the problem of recovering the mutual dis- 
tances between the nodes in an anchoritic sensor network 
by exploiting the time-space correlation structure of some 
random field observed by the sensors. This random field can be 
what the sensors are tasked with measuring, or some auxiliary 
measurements performed with the sole purpose to solve the 
localization problem. 

A. Description and Definitions 

Our algorithm is based on the assumption that the primary 
data field observed by the sensors can be modeled as a locally 
isotropic random field (for a precise definition, please refer to 
the discussion preceding Theorem 1); if not, then another aux- 
iliary physical phenomenon, which is locally stationary, can be 
alternatively measured by the sensor nodes, e.g., hydroacoustic 
noise near the seabed, cloud shadows cast on the ground, or 
even artificially introducing auxiliary signals like broadcasting 
random radio noise mandatorily. The sensor nodes record the 
measurements, or a function of the measurements, e.g., simply 
recording "1" when the measured signal is above a threshold, 
otherwise record "0". Therefore, only sequences of 0, 1 bits as 
well as data measurements will be transferred to the sink. The 
crucial intuitive idea behind this approach is the correlations 
between these sequences at different sensors decrease with 
the distance between the sensors. Then, by analyzing the 



correlation between these boolean sequences at pairs of nodes, 
we can approach the localization problem indirectly. Notice 
that, though each sensor has to reserve a storage space for 
recording the sequence of 0's and l's, this requirement only 
increases a negligible overhead, since even only 250 bytes 
contains 2000 bits. 

More precisely, we assume that a locally stationary random 
field (defined by conditions in Theorem 1) £ is measured by 
the sensors N = {1, 2, . . . , TV} in the network at synchronized 
instants t Q , t\, . . . ,tr- The position of sensor i is denoted as 
Zi — (xi, Hi) 2 - The sensor nodes record "1" when the measured 
signal is above a threshold, otherwise record "0" (this can be 
generalized to a multivalued record). 

Definition 1: For a subset I = {ix, 1%, , . . C N, define 
empirical instantaneous correlation functions to be 

1 T 

K T (/) = -^6 1 fefe(i j )-( is (i j ) ) (1) 
3=1 

where is the record of the field £ by the sensor i at 

instant tj. 

Remark 1: Here the requirement for synchronization is not 
tight, since even when clocks drift over a long period of time, 
a small time lag between the measurements will not affect the 
space-time correlation too much. 

Remark 2: In this paper, we are only interested in analyzing 
the correlation at pairs of nodes, and therefore 5 = 2. For 
5 > 3, it contains more information on the relative positions 
of the sensors (e.g., 5 = 3 forms a triangle). We defer this 
discussion to later work. 

B. Theoretic Framework 

When the field £ is stationary and ergodic with respect to 
time t, the empirical correlation function, in the limit of T — > 
oo, converges to its expected value (e.g., see Theorem 9.6 of 
[22]), 

k(I) =E& 1 £ J1 ...£ is . (2) 

In many cases, when the sensors form spacially separated 
clusters, I a , Ul a = I, the correlation function also clusters 
correspondingly to be 

hci 

where the approximation is good when the distances between 
the points of different ij's are large. 

Correlation functions give rise to cumulants. Cumulants 
{c;}z>i are defined as satisfying 

z i 

where ki = «({1, . . . , I})) are the correlations. The cumulant 
functions are nonvanishing at the diagonal (that is for the sets 

2 We concentrate on the plane localization problem in this note; our approach 
can be used in a high dimensional space or generally, an abstract metric space. 
For example, the sensors can be viruses planted at unknown nodes of a virtual 
networks unseen from behind a firewall. 



of points spatially close) and vanishing far from the diagonal 
(that is where the pairwise distances between the points grow), 
e.g., see [23]. 

We will concentrate on the pairwise cumulants, which 
reduce to the standard statistical correlation 



(5) 



for sensor i and j. The notation c{zi,Zj) = 02(1, j) may also 
be used when the exact locations of sensor i and j need to be 
stated. 

Remark 3: Notice that c(i,i) = Var(^) > and also 
c(i,i) > \c(i,j)\,Zj 7^ Z{. Further, if the correlation is 
continuous function of Zi,Zj, c(i,j) > for sensor i and 
sensor j close enough. 

The cumulants described above capture the dependence 
between the instantaneous values of random fields. One can 
also exploit also the dependence between different time values 
at different points. A general way to do so is to extend the 
dimension of the random field £. Consider the new field £a 
whose value at point z and time t is the trajectory of £ between 
t - A and t + A: 



fOM) = UO,s)} se [ t -A,t+A]- 



(6) 



The 2-point correlation functions involve in this case some 
general kernel functions K(s, s') and are given by 



(££'> = 



-A, A] 2 



€(a)K(s,s')Z(s')dsds'. 



(7) 



In Section |ni| we will present several examples where the 
values of empirical cumulants can serve as an efficient proxy 
for the internode distances, thus effectively resolving the first 
stage of the localization problem for the anchorite sensor 
networks. 

C. Proximity graph 

To solve the localization problem, we need construct the 
proximity graph. We assume a standard setting: in a plane 
region A C R 2 , N points are selected independently and 
uniformly with respect to Lebesgue measure. We will assume 
throughout the note that the area of A is normalized to the 
unity. 

We will reconstruct the positions of the nodes with respect 
to the first B < N points (called local beacons), whose 
coordinates are assumed to be known 3 . The positions of the 
rest sensors are unknown, and are to be determined. 

If the mutual distances di j = 1 2j — Zj | are known for all pairs 
of sensors, it is possible to reconstruct the whole configuration 
Z = {z%, . . . , Zn} up to an isometry of the plane preserving 
the positions of beacons (that is up to a rotation if B = 1 or 
up to an axis symmetry if B — 2). Obviously, the positions 
and the number of beacons have impact on the reconstruction. 

In an anchoritic network, the sensors do not know their 
mutual displacements, and we resort to the measurements of 



(empirical) correlations/cumulants as a proxy for the intern- 
odes distances. 

Denote the empirical correlation function of sensor i and 
sensor j by c(z%, Zj), and empirical correlation by S- T \zi, Zj). 
We will make the assumption that, with the sample size large 
enough the cumulants c(z%, Zj) for all pairs can be ap- 

proximated with desired precision by the empirical cumulants. 

To construct the proximity graph T at we connect each node 
i to fcjv nodes having the largest values of the empirical 
cumulants. For an ergodic field such that 

c(z, z) > c(z, z'), z 7^ z', 

and k]y — o(N), the resulting graph will approximate 
the corresponding fcjv- neares t neighbor graph based on the 
Euclidean distances between nodes 4 . 

We need the cumulants to satisfy the following two assump- 
tions. 

A: Convergence of empirical cumulants: c- T \zi,Zj) — > 
c(zi, Zj) for all !,j€NasT^ 00, and, 

B: Asymptotic isotropy: for all x,y € A, the set S X (S) = 
{y : c(x,y) > c(x) — 6} satisfies 



a unit circle, 



(8) 



y/Area(S x (6)) 
as 5 -> 0. 

Remark 4: Intuitively speaking, Assumption B means that 
the left hand side of l|8} can be sandwiched between two circles 
with radii arbitrarily close to 1, as S — > 0. 

Then , the following result is valid. 

Theorem 1: Under Assumptions A and B, let fcjy = 
log N c , c > 1, then, for any i, 



(9) 



is the 



for any b = 1 . . . , B, with high probability. Here h\. 
distance between nodes i and j in T^r (hop distance) and dij 
is the Euclidean distance between Zi and Zj. 

Proof: First, we need prove the following result. Con- 
necting each pair of sensors with a distance less than 



r(N) 



(log NY 



irN 



c > 1, 



(10) 



one obtains a graph G(N). Let h\j be the hop distance 
between nodes i and j in G(N). Then, for any e > 0, we 
have 



lim I 

N— >oo 



h^r(N) 



1. (11) 



The argument goes as follows. Choose p(N) = y Cll °^ N 
where c\ > 2.5. For sensor i and j with positions Zi and Zj, 
connect them by a sequence of circles of radius p(N) such 
that the centers of the adjacent circles are r(N)~2p(N) away 
(when N is large, r(N) — 2p(N) > 0). We call sensor i and 



3 This is not a departure from the assumptions of anchoritic nodes: these 
special sensors do not broadcast any specially structured signals. 



4 A fcjv -nearest neighbor graph is build on a metric space in such a way 
that each vertex is connected to k nearest vertices. 



j to be p-vicinity connected if there exists at least one sensor 
lying in each of the circles along the line (Zi,Zj). This is 
shown in Figure ^ 



sensor j 



- r-2 P - 

Fig. 1. p- vicinity connection 

Therefore, for any i, j 6 N and large N, we have 

P[ i,j are not p- vicinity connected] 

= P[at least one circle along (Zj, Zj) is empty] 
< E[number of empty circles along (Z i: Zj)] 
V2 



< 



r-2p 
r-2p 



P[one given circle is empty] 

(1-npT 



2ttN 



V (log^) e 



-N~ 



(12) 



implying, 



i,j are p-vicinity connected, Vi, j £ N] 

> 1 — P[ i,j are not p- vicinity connected 



> 1 — I I n 

,2 



Passing N to infinity, we obtain 



(13) 



lim P[ i,j are p-vicinity connected, Vi, j S N] = 1. 

AT— >oc 



Hence 



lim ] 



< e,VZ u Zi e Z 



> lim P[ i,j is p-vicinity connected, Vi, j € N] 

JV— «X) 

= 1- (14) 

Next, choosing ri(iV) = — e ■ r(N) and r 2 (N) = 
VI + e ■ r(N), we obtain graph G~ e (N) and G +€ (N), re- 
spectively. Actually, we can prove that P[G~ e (N) C T(N) C 
G e (N)] — > 1, when N — > oo, and the argument goes as 
follows. 

In graph G~ e (N), we define for sensor i the indicator 
function 

Xij = l(sensor i and sensor j are connected). (15) 
For all j 7^ i, {X^} are i.i.d random variables. Let Yi be the 



number of neighbors in G e (N) for sensor i. Then, 



[Yi > k N ] = 



> 



iV 



vT 



=E[Xi 



(16) 



and, from well-known large deviation results, there exists > 
such that 

P[F, > fcAr] < e - 0N . 



(17) 



Therefore, 

P[Yi > k N , for some ie N] < 



oo, 



< AT -e 
^0 as iV- 

From Jl 8I > and condition we have as iV 
P[G" £ (A) c r(jv)] -> 1. 
By the same argument, as A — * oo, we can also prove 
P[T(iV) C G e (N)} -> 1. 
Combining (II It . (1191 and (I20> . then passing <5 — * 



(18) 

(19) 

(20) 
0, we 



complete the proof. ■ 
From this theorem, we see that knowing the cumulants is 
enough to reconstruct, with arbitrary precision, the positions 
of all nodes in the networks, assuming that the network is 
large enough. 

In the next section we consider several models for various 
"real-life" inspired ergodic random fields which could be used 
for the distance estimation purposes. 

III. Models of Random Background Fields 

In this section, the sensors are assumed scattered uniformly 
in an open space. For the case when the sensors are not 
uniformly distributed, one might need introduce local mas- 
ters/sinks. We will discuss the ramifications in Section [Tvl 

A. Boolean Model 

This model imitates an anchoritic sensor network with the 
shadow/light patterns used as the auxiliary random field. The 
shadow/light detection can be done in an inexpensive fashion 
in terms of both hardware costs and energy consumption. 

To model the shadow patterns generated by "clouds" (the 
clouds here can be real clouds, or artificially introduced 
acoustic noise signals or electromagnetic disturb), we will 
apply the widely used Boolean model (see e.g. [24]). The 
model is specified by a point process V and a class of bounded 
random sets B. To keep matters simple, we assume that V is 
a Poisson point process, and the B £ B is a circle with a 
random radius R. Given the pair (V^B), the random set C is 



C= [J (Z a + B a ), 



z a ev 



where B a are iid realizations of the sets from B. 

At each instant t, the sensors located in the field observe 



1 if z e C and 
otherwise. 



For thus constructed random field £, the correlation function 
between any 2 points z, z' can be explicitly computed. Let 
V(d(3) be the probability measure on the shape space B. For 
any (3 e B and z£R 2 define 

^(z) = |/3n(/3 + z)| 

to be the Lebesgue measure of the intersection of shape and 
its displacement by z. For example, if shape is a ball of radius 
r, then, for any \z\ < 2r, 



ip(z) = 2 ar 

and, therefore, one has 
Proposition 1: 

c(z,0=E£(z)E£«) 



|*| \z\^r*-\z\* 



2r 



(21) 



and 



-l"lp(d/?). 



¥(dj3) - 1 



If the time intervals between each measurement are spaced 
far enough, one can model the realizations of the random sets 
Ct,t = 1, . . . , T as iid, and the empirical correlation function 
between any two points z, z' converges to c(z, z 1 ). Applying 
formulae of Proposition ^ one can recover the internode 
distance data. 

B. Large Clouds 

A variant of the Boolean model deals with the unbounded 
shapes, the "large clouds". Here the clouds are represented 
as the parallel strips of random widths. More precisely, we 
consider the random set C to be bounded by a family of 
parallel lines, which are orthogonal to a direction that is 
chosen uniformly from the unit circle, and whose orthogonal 
projections to this direction form a Poisson point process of 
some constant intensity (one can check that this definition is 
independent of the choice of the origin in the plane). 

In other words, z <E C if 



X2i < (z,e) < x 2i+ i,i 



-1,0,1,. 



where (,) denote Euclidean scalar product, e is a random 
vector chosen uniformly from {|e| = 1} and {ife}!^ is a 
Poisson point process with constant intensity A. 

Again, in this situation it is easy to find the 2-point 
cumulant. In fact, if a random set is translation and rotation 
invariant, and each realization of the set is a union of infinite 
strips, then 

Proposition 2: 



c(z,z') = 

for some constants a. b > 0. 



b\z - z'\ 



Once again, we see that the cumulant attains its maximum 
on the diagonal, and therefore can be used for recovering the 
distance data in anchoritic sensor networks. 

C. Random Walkers 

This model describes the random field generated by some 
independently moving objects: as an example, one can imag- 
ine sensors registering heat/verberation of an animal moving 
nearby. 

To model this, we consider random field represented by a 
family of random walkers in the area A. At time t a sensor 
located at position z registers £ = 1 if there is a walker at a 
distance at most r from z. 

We will use the spatio-temporary version of the correlation 
function, that is we will consider also the lagged correlations 



^E£(M)£(z',i- 



< A. 



(22) 



This trick stretches the cumulants of close points over the 
interval A. The precise expression for the cumulants in this 
model is a polynomial in Gaussian functions and is quite 
cumbersome, so we will not present it here. We just state 

Proposition 3: The 2-point cumulant function for the ran- 
dom walkers model is positive on the diagonal and vainishing 
at the infinity. 

IV. Engineering Ramifications 

The idea of anchoritic sensors can be extended to situations 
where heterogenous sensors are combined. Some sensors are 
powerful to chat and can form the backbone of local mas- 
ters/sinks, while other sensors are so weak that they can only 
collect measurements and report them to a nearest sink. The 
network architecture is depicted as in Figure |2] 



cluster 



□ local master • * ra • ! El • 1 ED • 

* f •• 

Fig. 2. Combine Deaf Sensors and Local Sinks 



The sinks are powerful nodes that can exchange information 
and, if possible, can even engineered to be able to generate 
radio or acoustic signals at random time with random strength. 
These auxiliary signals will be measured by other emaciated 
sensors. This measuring procedure can be simply record or 
1 by judging the strengths of the signals. 

Sensors in a cluster periodically report their data to the 
local master/sink. Using the correlation values, the local master 
can estimate the positions of the sensors in its cluster. For 
this structure, the sensors are only required to be uniformly 



scattered within a cluster, and the density can change for 
different clusters. 

Once again, the distinguishing feature of this approach is 
that it does not require the broadcasting of globally structured 
signals from a central entity. 

V. Experimental Results 

In this section we present simulation experiments on prox- 
imity graph reconstruction in anchoritic networks, based on 
several random fields described above. In all these simulations, 
TV = 1000 sensors were chosen independently at random from 
uniform density in the unit square A, and the simulation goes 
in a discrete fashion t — 1,2, ••• ,2000. These sensors are 
represented as little squares on the corresponding plots. The 
proximity graph T is formed by connecting a given number 
(fc/v = L(l°g ^V) X - 2 J = 10) of pairs of nodes with the largest 
value of the empirical cumulants (the reason why we choose 
/sjv = (log TV) 12 comes from the proof of Theorem 1). 

A. Round clouds — Boolean model 

In this simulation, round clouds of random radii uniformly 
distributed on [0, .2] are modeled as a Poisson random field 
with density 30. Blue circles on Figure [3] depict a realization 
of the Boolean model; the empirical cumulants were formed 
from T = 2000 independent samples. 

The proximity graph T shown on Figure [5] is formed by 
connecting a given number (kpj = 10) of pairs of nodes with 
the largest value of the empirical cumulants. one can see that 
this graph strongly resemble a nearest neighbor graph: there 
are very few edges connecting nodes far away, and all pairs 
of close nodes are connected. 




Fig. 3. Boolean model with round clouds. 



In this example, on each corner of the unit square, a beacon 
with known position (illustrated by a red square) is shown. 
Though only four beacons are given, based on the proximity 
graph, we still can give a reasonably good estimation of 



locations for most of the sensors. This result is discussed in 
Section EH 

B. Big clouds 

The big clouds in this simulation were modeled by half 
planes (a realization is shown in blue on Figure^ bounded by 
lines with isotropic orientation. Again, a given number (k^ = 
10) of pairs of nodes with the largest value of the empirical 
cumulants were selected; a visual inspection indicates high 
similarity of this graph with the nearest neighboring graph. 




Fig. 4. Random sets are formed by isotropic half-planes. 



C. Random walkers 

We consider here W = 10 random walkers, which are 
monitored by sensors of a sensing radius r = .13. Yellow 
trajectories show part of the traces of the walkers. 

Consider the lagged correlations, with lags equal to A = 2. 
For each sensor we then select fcjv = 10 neighbors with the 
best values of empirical cumulants. 

One can see on Figure [5] that the quality of the proximity 
graph in this situation is almost the same as that in the previous 
two examples, even in the presence of the relatively slow 
convergence of the random fields defined by the walkers: to 
ensure the convergence of the empirical cumulants to their 
average values one needs at least the convergence of the 
(normalized) occupation measure to the uniform measure. 
While from Figure |5] it is evident that the occupation measure 
in the simulated example is still far away from the uniform 
measure (which can be seen from the scatter plot of cumulants 
illustrated in Figure [8] of Section IV-D> . Yet, even given this 
source of imperfection, the result can be reliably used to 
estimate the proximity of the nodes. 

D. Quality of distance approximation 

A better feeling about the quality of the distance data 
recovery using the correlation between nodes can be gained 
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pairs of points are shown. 



Fig. 5. Part of the traces of W = 10 random walkers (reflected at the 
boundary) are shown in yellow. The region in which the walkers move 
is larger than the region A where the sensor network is deployed, to 
avoid irregularities at the boundary. 



from the scatter plots, which show the cumulant values versus 
the distances of pairs of sensors. The first two scatter plots, 
the round clouds model in Figure|5]and the large clouds model 
in Figure Q show the results for all pairs of sensors. 
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Fig. 6. Cumulant-distance scatter plot for the round clouds model. 
All pairs of points are shown. 



For the random walkers model, the scatter plot (Figure 
|8jl shows the cumulant-distance pairs with one of the nodes 
fixed. The heterogeneity of the occupation measure leads to 
significantly different ranges of the cumulants at different parts 
of the region. However, for any particular node, the cumulants 
can be very efficiently used for selecting the closest nodes, as 
the plot shows. 



Fig. 8. Scatter plot of cumulants versus distances for all pairs of 
sensors with one fixed: blue dots show the data for one node in the 
region of high occupation density; red dots correspond to another 
node with low occupation density. The plots are visibly similar, 
exhibiting high reliability of cumulant estimator. 



E. End-to-end localization in anchoritic sensor networks 

For the round clouds model, we used the cumulant-based 
proximity graph to approximate the internode distances and 
ultimately to reconstruct the positions of the sensors. Since the 
second step of reconstructing exact positions is not the main 
focus of this paper, although it is very important for localiza- 
tion, we choose to do it in a rather naive way. We compute 
the hop distances of the sensor of interest to the four beacons 
lying on the four corners using Dijkstra's algorithm. Assuming 
the hop distance is proportional to the real Euclidean distance 
(which is shown in Theorem 1), we can estimate the locations 
of all sensors, and the results are shown in Figure lV-El One can 
see that boundary effects (caused by the inefficient algorithm 
we used here) are rather significant, yet in the interior of the 
area the positions are recovered quite well. 

Here we did not use any sophisticated machinery for the 
second step of the localization problem, and a more holistic 




Fig. 9. Results of the end-to-end positions estimation in a anchoritic 
sensor network. Round clouds cumulants were used to generate the 
proximity graph. The actual node positions are shown as yellow 
squares connected to their estimated positions (green squares). 



approach would be to generate a Gibbs measure, the ensemble 
of N nodes in A whose distribution would consist with 
the empirical measurements. Sampling from this distribution 
would give the most probable positions of the sensors in the 
region A. 

VI. Conclusion 

As demonstrated above, the cumulant evaluation technique 
allows one to address the localization problem in anchoritic 
sensor networks, where the chatter between the nodes is 
undesirable or infeasible. We would like to indicate several 
research directions prompted by the present study, which we 
hope to pursue: 

1) We concentrated in this study on pairwise cumulants. 
This fits into the customary context of the existing 
approached to the localization problem in restricting 
attention to pairwise mutual positions of the nodes. Yet, 
the study of correlations allow one to gain much more 
information by considering the higher order cumulants. 
For example, in the big clouds models, the leading non- 
constant term in the fc-order cumulant is proportional 
to the circumference of the convex hull of the nodes 
for which the cumulant is computed. Finding system- 
atic ways to use this information is a challenging and 
promising task. 

2) We approach the second step of the localization problem 
— finding the positions from the cumulant data — by 
generating the proximity graph. More holistic approach 
would be to generate a Gibbs measure, the ensemble of 
N nodes in A whose distribution would consist with the 
empirical measurements. Sampling from this distribution 
would give the most probable positions of the nodes in 
the region A. 
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